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Abstract 

We study minimal vertex covers of trees. Contrarily to the num- 
ber N VC (A) of minimal vertex covers of the tree A, log N VC (A) is a 
self- averaging quantity. We show that, for large sizes n, lim n ^ +00 < 
log N VC (A) > n /n = 0.1033252 ± 10~ 7 . The basic idea is, given a 
tree, to concentrate on its degenerate vertices, that is those vertices 
which belong to some minimal vertex cover but not to all of them. 
Deletion of the other vertices induces a forest of totally degenerate 
trees. We show that the problem reduces to the computation of the 
size distribution of this forest, which we perform analytically, and of 
the average < logiVy C > over totally degenerate trees of given size, 
which we perform numerically. 

1 Introduction 

The vertex-cover problem, as other combinatorial problems, is arousing grow- 
ing interest in the fields of statistical physics and disordered systems. In 
particular, it helps to understand, and the machinery of optimization algo- 
rithms helps to solve, spin-glasses and random hamiltonian models (see [l] 
for a recent review of the problem, [2] for a critical analysis point of view). A 
possible question is : given a graph, what can be said about the size and num- 
ber of its minimum vertex covers ? Another approach consists in answering 
this question on average, for a given statistical ensemble of graphs. 

In this paper, we are concerned with average behaviour, and focus on 
the simple situation of trees. In this case, good algorithms are known, for 
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instance based on the so-called b-colorings (see f3] and sectEJ, to find the 
number N VC (A) or size of minimal vertex covers of a given tree A. In fact, if 
each tree of given size n has the same probability, then the average number of 
vertex covers can also be retrieved analytically by means of these b-colorings. 
However, this is not a self-averaging quantity for large n, and it would be 
desirable to find a thermodynamically extensive quantity giving a somewhat 
more physical insight into the number of minimal configurations of a random 
tree. 

We claim that < log N VC (A) > n is indeed self-averaging, and the reason 
is as follows. Suppose that we delete from A all the vertices which are not 
degenerate (that is, those which belong either to all the minimal vertex cov- 
ers of A or to none of them). Then we obtain (see sectJHJ) a forest with the 
same number of minimal vertex covers as A and whose vertices are all de- 
generate. Moreover, in this forest, the number of trees of given size scales 
thermodynamically with the size of A (see sectlH), and the probability of 
appearance of a given tree depends only on its size. In other words, as far 
as we are concerned with the number of minimal vertex covers, picking at 
random a tree on n > 1 vertices amounts for each i > 1 to picking with 
uniform law totally degenerate trees on i vertices. And, in turn, it is ex- 
pected that such a typical tree A verifies log N VC (A) « nJ2i c i < logiV TC >f, 
where < log N vc >f is the average of log N vc over totally degenerate trees on 

1 vertices. 

The computation thus reduces to that of the scaling parameters q for the 
size distribution, and to the evaluation of the average of log N vc over totally 
degenerate trees with given size (see sectJSJ. 

But let us begin with a remainder of some basic facts and the crucial 
theorem on b-colorings. 

2 Preliminary observations 
2.1 Basic definitions 

A graph is a pair A = (V, £) where V is a set with n > 1 elements (written 
|V| = n in the sequel) and £ is a subset of {{x, y} C V; x ^ y}. V is the set 
of vertices of A and £ the set of edges of A, n is the size of A, denoted by 
\A\. In this paper A is called a labeled graph if V consists of positive integers. 

Given two distinct vertices x, y of the graph A = (V, £), a path from x 
to y in A is a sequence {vq, vi}, {vx, V2}, • ■ • , {i> p -i, v p } of edges of A such 
that Vq — x, v p — y and v^ ^ Vj if i ^ j. A graph is called a tree if any 
two distinct vertices are connected by a unique path, and a forest if any two 
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distinct vertices are connected by at most one path. 

A rooted tree is a triple (V,£,r), such that (V,£) is a tree and r <EV. 

A vertex cover of the graph A = (V, £) is a subset of V containing at least 
one end of each edge of A. A vertex cover of A is minimal if there does not 
exist any other vertex cover with less elements. In the sequel, the number of 
minimal vertex covers of A is denoted N VC (A). 

2.2 Some useful results 

The exponential generating function of rooted trees It is defined 
as T(x) = ^2 A j^r, where the sum runs over all rooted trees. Cayley's 
formula states that the number of rooted trees on n vertices is hence 
= J2 n >i r ^nT xn an d this implies that T(x) = xe T ^ as can be deduced 
by a direct combinatorial argument relying on the recursive nature of rooted 
trees. 

A theorem on minimal vertex covers of trees It has been shown in 
|H1 that, for any tree A = (V,£), there exists a unique triple (B,TZ,G) C 
V x £ x V, called the b-coloring of A, such that 

• B,G and the set of end-vertices of 1Z form a partition of V . 

• The edges in 1Z are non-adjacent; the edges with one end- vertex in G 
have the other end- vertex in B\ each vertex in B is connected to G by 
at least two edges. 

Moreover, the b-coloring of A has the following connection with its min- 
imal vertex covers : B (resp. G) is the set of vertices contained in all (resp. 
none) of the minimal vertex covers of A. Consequently, any end- vertex of 7Z 
is contained in some minimal vertex cover of A but not in all of them : these 
vertices are called degenerate. 

An additional result is that any minimal vertex cover of A contains exactly 
one end-vertex of each edge in 71. Consequently, a vertex cover of A is 
minimal if and only if it contains \B\ + \1Z\ vertices. 

In the sequel, vertices in B and G and end-vertices of 1Z will be called 
respectively brown, green and red vertices, while edges in 1Z will be called 
red edges. A tree with no brown or green vertices is said to be red. 

3 Red forest of a tree 

Given a tree A = (V, £) and a non-empty set S C V of vertices, the forest 
induced by A on S is defined as (S, £') , where £' consists of those edges in 
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£ with both ends in S. If A is a tree with b-coloring (B,TZ,G), and such 
that 1Z 7^ 0, define the red forest of A to be the forest induced by A on the 
set of red vertices. Denote A\ = (Vi, Si), ■ ■ ■ ,A P = (V p , £ p ) the trees of that 
forest. Then it follows at once from the definitions that Ai has b-coloring 
(0, 1Z fl Si, 0), hence is red. But if C is a minimal vertex cover of A, C fl 
is a vertex cover of Aj. Since C contains exactly one end of each red edge of 
A, C nVi contains \TZC\ £j| vertices of Ai : it is a minimal vertex cover of A4. 
Now, given minimal vertex covers C\, ■ ■ ■ , C p of the Aj's, B U Ci U ■ ■ ■ U C p is 
a vertex cover of A (because an edge of A either is an edge of some Ai or has 
at least one end in B), which is minimal since it contains \B\ + \1Z\ vertices. 
It is in fact the only minimal vertex cover of A which coincides with Ci on 
each Ai, and this proves that : 

v 

N VC (A) = Y[N vc (Ai). 

i=i 

Let us define the size distribution of a forest F as the sequence D = 
(Di)i>i, where Di is the number of components of size i in F. Given two 
forests F 1; F 2 of red trees, with same size distribution D, there is no difficulty 
in proving that the numbers of trees with red forests respectively Fx and F 2 
are equal. In other words the number of trees on n vertices with given red 
forest F depends on F only via its size distribution D : this number shall be 
denoted vein) in the sequel. Note that ^c(n) = if Di ^ for some i > n. 

If we denote by Aj the sum over red trees R on i vertices of log N VC (R), 
the preceding remarks allow to write our sum over trees of size n as 

^\ogN vc {A) = Mn){Di^i + D 2 \ 2 + ■■■ + D n \ n ). 

A D 

We are thus led to the computation of the z/p's and Aj's. Note already that 
a red tree has even size, whence ^£>(n) = if D 2i+ \ 7^ for some i. We now 
come to the analytic computation of V£>{n). 



4 Size distribution 

Denote by G, B, R respectively the exponential generating functions for the 
number of rooted trees with root of color green, brown and red. For instance, 
G(x) = ^ A jjpx^, where the sum runs over all rooted trees with green root. 
The following relations hold between these generating functions (see |3] for 
details and the combinatorial meaning of U, Q) 
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U = xe B+R {e G -l) 

B = xe B+R (e G -l-G) 

R = x Qe B+R 

Q = xe B+R , 



leading in particular to B(x) = T(x) + T(—T(x)) — T(—T(x)) 2 . Now, let us 
look more closely at those trees with red root. The red forest of such a tree 
A has exactly one component containing the root, and the size s(A) of this 
component can be encoded in the following generating function, where the 
sum runs over rooted trees with red root 

Ro(x,y) = J2uu xlAl y s{A) - 

Since R = xyQ e B+Ro with Q = xye B+R \ it follows that R = T(2x 2 y 2 e 2B ^) /2, 
and the total number of red components of size 2p among labeled trees of 
size n is 

^n\[x] n [y} 2p Ro(x,y) = ^^n\[x] n . 2p e 2 ^ 

A straightforward application of the saddle-point method then shows 
that, for large n, the average number of red components of size 2p scales 
thermodynamically with n : C2 P (n) ~ C2 P n and 

= (2py^_ T>T 2 P -l e -2 P T*, 2T 2 _ (1) 
pi 

where, in the above formula, T(x) and its derivative are taken at the saddle- 
point x = —I. For large p, we get that tends to log(2eT 2 exp(-2T 2 )) w 
—0.0844424236, showing that c 2p decays exponentially. 

Now, we make the "thermodynamic limit" assumption that the number 
of trees with given size in the red forest of some random large tree is a self- 
averaging quantity. That is, we suppose that, for large n, the trees which 
contribute significantly to < logiV vc > n have indeed C 2 trees of size 2, C 4 
trees of size 4,- • •. The distribution v D hence becomes irrelevant, since it 
concentrates on one particular value, and the average becomes 



A 



2i 



lim < logiV„ c > n /n = V c 2i 

n-++oo ' iV 2 j 



where ~N 2i denotes the number of red trees of size 2i. 
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5 Minimal vertex covers of the red trees 



In this section, we compute analytically the number N 2p of red trees on 2p 
vertices and give a numeric estimate of the A2«'s. In fact, one could deduce 
directly R(x), whence the iV 2p 's, from the set of equations on B,G, R,Q,U 
stated in the preceding section. But we prefer to give a direct combinatorial 
derivation which, after slight adaptations, shall give also the total number of 
minimal vertex covers among red trees of size 2p. 

5.1 Overview 

As was already emphasized, a red tree A has an even number of vertices, say 
2p, and we associate to A its shrinked tree A as follows 

• The vertices of A are the red edges of A, so A has size p ; 

• Two vertices of A are connected in A if and only if the corresponding 
two red edges of A are connected by some other edge in A. 

This procedure is uniquely defined and, if the set of vertices of A is V, that of 
its shrinked tree is a partition of V into sets of 2 elements. Such a partition 
will be called a pairing of V : note that it consists of the red edges of A. 

Conversely, let V be a set (|V| = 2p). There are pairings of V, and 
p p ~ 2 trees with set of vertices equal to one of these pairings. Given such a 
tree B, the number of red trees on V with shrinked tree B is 4 P_1 , because 
each of the p — 1 edges of B leaves 4 possibilities for the corresponding edge 
of the red tree. Hence, the number of red trees on 2p vertices is 

p\ 

so the number 2pN 2p of rooted trees has exponential generating function 
R(x) = T{2x 2 )/2. 

Let us now enumerate the total number of minimal vertex covers among 
the red trees of size 2p. Consider a minimal vertex cover on a labeled tree A 
of size 2p. To encode this vertex cover, add an arrow at each covered end of 
each black edge (that is, each edge which is not red). By definition of vertex 
covers a black edge is either oriented (one arrow) or bi-oriented (two arrows). 

Now, we apply the shrinking procedure as defined above, but we keep 
track of the orientations : this leads to a tree on p vertices, each edge being 
either oriented or bi-oriented. 

Again this procedure is uniquely defined. If V is a set on 2p vertices, 
the number of trees with set of vertices a pairing of V and with edges either 
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oriented or bi-oriented is ^fp p_2 3 p_1 . Given one such tree B, the number of 
covered red trees A with shrinked tree B is 2 P . Indeed, each of the p vertices 
of B corresponds to a red edge of A, which may be covered in two ways. 
Once this choice has been made, the way the black edges connect the red 
edges with each other is completely constrained by their (bi-)orientation. 

Hence, the total number of minimal vertex covers over red trees of size 
2p is 3^(3p) p " 2 , and the average number of minimal vertex covers among 
red trees on 2p vertices is < N vc >5,= 2(3/2) p ~ 1 . 



5.2 Theoretical viewpoint 

Both for theoretical understanding and for numerical purpose, it proves useful 
to focus on rooted trees, and we denote by n+(A) (resp. n_(A)) the number 
of minimal vertex covers which contain (resp. do not contain) the root of the 
rooted red tree A. 

A red tree with root r may be seen recursively as an edge {r, r'}, with 
both ends connected to the root of arbitrarily many red rooted trees. And it 
is clear (see [2] for details) that a set S of vertices of A is a minimal vertex 
cover of A if and only if : (i) it induces a minimal vertex cover on each of 
these attached subtrees (ii) exactly one end of {r, r'} is not in S (iii) the edges 
incident at this vertex have the other end in S. Consequently, denoting by 
Ai the red trees attached to r and by A'j those attached to r' : 



n_(A) = n n +(^)II( n +(4)+M4)) 



(2) 
(3) 



Now, let us have a closer look at the generating function for rooted red 
trees R(x) = T{2x 2 )/2. As follows from the equation for T, R should be 
such that R(x) = x 2 e 2R ( x '. Combinatorially, this means that the number of 
rooted red trees on 2p vertices is 



(2p)l[x 



2p-2i 



^ k\ 

k>0 



E X \A\ 
A 1 1 




fc'>0 



\A\ 



2s w 



(4) 



where A ranges over rooted red trees. But building a rooted tree on n 
vertices amounts to choosing (i) the root r and the vertex r' with whom r 
shares its red edges (2p(2p— 1) ways) (ii) the numbers k and k! of rooted trees 
attached respectively to those vertices (iii) those trees themselves A%, ■ • • , A/. 
and A[, - ■ ■ , A' k ,, in such a way that their total number of vertices is 2p — 
2 (iv) finally, a relabeling of those trees which exhausts the labels ^ r, r' 
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({2p — 2)1/ (Yl \Ai\\ Yl \A'j\\) ways). Each term of the expansion of Y^a |^ry 
corresponds to a particular ordered choice in (iii), and the \/k\ factor just 
gets rid of this ordering. This is true also for the primed term, hence the 
combinatorial meaning of the equation for R is clear and we now apply it to 
our vertex covers problem. 

The set S of functions N 2 — > R is a vector space. If is such a function, 
and <fi(a,b) = x ab ,a,b E N, we write (p = Y. a ,b x ab(a,b). If ip = J2 a ,b x ab( a i b ) 
is another function, let their product be <p * if) = Y2a b a i y x a \,x' a iy{(io! ^ bb'). 
S is then an algebra, generated by the (a,b),a,b E N. Let a be the (al- 
gebra) morphism such that a(a, b) = (6, a) for all a, b and p the (vector 
space) morphism such that p(a,b) = (a + b,a). Then eqs. (|2|3ft rewrite 

{n + {A),n_{A)) = Ilp(MA),n_(4)) *UM n +{ A j)> n -( A j))- Hence > our 
remarks on the combinatorial meaning of eq. (J2J show that the formal power 

series R + _(x) = Y,Ai n +i A )^ n -( A ))lA\i 0De y s the equation 

R + _( X ) = x 2 e pi?+-(x)+ CT pi? + _(x)_ 

Of course, in this equation, the exponential is defined by its power series, the 
product being as defined above. 

Let fi m be the (algebra) morphism such that fi m (a, b) = a l b m for all a, b. 
Then fR + _(x) = x 2 e^ pR+ ^^ + ^ apR+ ~^ x \ so we have the following generating 
functions for rooted trees: 

^ , x\ A \ 

R lm ( X ) = 22 n +( A ) n -( A r' 



A 



A\\ 



x 2 exp f j R k 

,l+m—k 

+ ( h) Rk 

,l+m—k (5) 
\k=0 ^ ' k=0 ^ ' I 



For the first two values of n, the resulting system of equations is easily 
solved. For instance : 



For I = m = : R 0)0 (x) = x 2 e 2Ro ^ x \ so Rq, g {x) = T(2x 2 )/2 as expected. 

For I = l,m = or I = 0,m = 1 : Ri,o(x) = Ri t o(x) = x 2 e 3Rl >°( x \ so 
Ri,q(e) — T(3x 2 )/3, again in agreement with the formula above. 

And this seems to be the largest value of n for which the exact solution 
functions are retrievable. In the case where I + m — 2, the system reduces 
to an implicit expression for i? 2 ,o : #2,0 = x 2 exp (3i?2,o + 2R 2 fie~ R2 -°) , still 
allowing asymptotic computations. However, we have not found a systematic 
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k 


1 


2 


3 


M k 


0.20273 


0.41576 


0.63658 



Table 1: Moments M k = lim p _, +OQ ^- log < N% c >^ of the number of minimal 
vertex covers of red trees, as obtained from eq. J^j. 

treatment for the study of eq.JHJ) which would have been a possible starting 
point for the replica method. 

We now come to the numerical evaluation of the A 2 p's. 

5.3 Numerical computations 

Given a red tree A on 2p vertices, one can choose any of its vertices as a 
root and apply recursively equations (I2|3jl to compute n+(A),n-(A) in 0(p) 
time. However, the number of such trees increases exponentially with p, and 
systematic enumeration soon becomes a challenge. 

For small trees (p < 16), we compute the exact distribution of the number 
of minimal vertex covers. The algorithm is based on an exhaustive recursive 
enumeration of rooted trees followed by systematic unshrinking. 

For larger trees, we proceed as follows. The number of red trees with 
given shrinked tree A depends only on \A\, and every red tree on 2p vertices 
has a unique shrinked tree, which is of size p. Hence, to pick randomly a red 
tree on 2p vertices with uniform law, it suffices to : (i) Pick randomly a tree 
A on p vertices, with uniform law (this is conveniently done by means of the 
Priifer bijection between those trees and sequences of {1, • • • ,p}^ 1 '"'' p ~ 2 ^) (ii) 
Choose, again with uniform probability, one of the red trees with shrinked 
tree A. 

The number of samples picked for each size was chosen so as to ensure 
a precision of 10~ 7 on < logA^ c > fn. From the fact that < logA^„ c > n 
fn — ^ c 2i A 2i /A^2j, it follows that an error 5 2 i on ^-j^ leads to a maximum 
error c 2i 2i5 2i on < logN vc > n fn. From eq.JTJ we see that c 2i decays 
exponentially fast with i : in practice, we took 8 10 9 samples for each size 
17 < p < 45 and 1.5 10 8 samples for sizes 46 < p < 189. And this leads to 

lim < logN vc (A) > n fn = V c 2p < logiV vc >£ = 0.1033252 ± 10" 7 

Those numerical simulations also give evidence that, for red trees of large 
size 2p, the random variable X p = (log N vc )/{2p) is self-averaging. Indeed, 
for each of the sizes considered in the previous paragraph, it is possible to get 
the approximate distribution of X p , and it appears that (X p — < X p >)y/p 
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approaches a fixed gaussian distribution for large p. Numerically, we find 
lim p _ +00 < X p >= lim p _ +00 < logN vc >J /2p = 0.1963 ± 10" 4 , to be 
compared with the first few moments of table [TJ. In fact, approximating 
the first few M^'s by a rational function leads to estimate linip^oo < X p >= 
^j[^\ k=0 pa 0.196, a result remarkably close to the expected limit. Good un- 
derstanding of this self-averaging feature would certainly be a crucial issue 
in the exact derivation of linip^+oo < X p >, and presumably also of the cor- 
responding limit for general trees. 

/ am very grateful to Michel Bauer for interesting remarks and discussions. 
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